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@ Pipelined fast fourier transform processor. 



\ A pipeline Fast Fourier Transfonn arrangement includes a cascade of four Butterfly Aritiimetic Units 
(BAU). Weighting and control signals are coupled to each BAU in tine cascade. A multiplexed memory 
arrangement operated in a "ping-pong" manner stores four-stage partial FFT signal as It ife getierated, 
and retums it to the first BAU in tiie cascade for subsequent passes to generate multiple-stage FFT 
signals. Each BAU includes local memories, cyclically fed and addressed to generate temporal offsets, 
which differ aniohg tiie processors of the cascade. 
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Background of the Invention 

This inv ntion relates to fest Fourier transfonn (FFT) processors, and particularly to those processors used 
for pipeline FFT operations for real-time or quasi-reai time op ration. 

5 Many signal transmissions occur in environm nts in which they are conupt d by broad-band noise, which 

tends to obscure the signal. The Fourier transform of signal corrupted by noise often improves the detectability 
of the signal in the presence of such noise, in general, the Fourier transform of a time-varying signal used for 
communication or detection resuite in significant signal-related amplitude concentrated within a comparatively 
narrow frequency range. Specific applications of Fourier transfonms include ranging and detection systems 

10 such as radar and sonar. 

The fast Fourier transform (FFT) is a method which can be used with certain processors for reducing the 
time or computational effort involved In finding a discrete Fourier transform (DFT) in a digital processing system, 
as described in the article "A Guided Tour of the Fast Fourier Transfomi," by Bergland. published at pp. 41-51 
in the July 1969 issue of IEEE Spectrum. For systems such as radar and sonar which are used for guidance 

IS of ships and aircraft. It is important that the location display of targete be performed substantially real-time. In 
this context real-time means that the display should coaespond substantially to the cunnent position of the 
targets, where curency is related to the speed of motion of the target or the vehicle in which the display is moni- 
tored. Thus, a delay of one second in the display of a ship might be tolerable, where the display is used solely 
for navigation, whereas for detection or guidance of missfles, much shorter processing times would be neces- 

20 sary. 

Short processing times are often associated with processing by the pipeline technique, as opposed to batch 
processing. In the pipeline technique, sensor data continuously or almost continuously flows through a pro- 
cessor to produce a continuous stream of processed resuite. delayed only by the processor delay. A pipelined 
FFT processor is described in U.S. Patent No. 3.81 6.729 issued June 1 1 , 1 974 in the name of Worics. The Worics 

25 pipeline processor implements the Cooley-Tukey FFT algorithm by cascading a plurality of units, each including 
a plurality of processors, each processor of which includes a multiplier and two summers, together with switch- 
ing and delay circufte. The pipeline processor further includes a rotation vector memory and control logic for 
controlling the operations of the various cascaded units and processors. The Worics system is disadvantageous 
because at each step of the butterfly processing a separate processor is required for each node, as for example 

30 FIGURE 1 of Worics illustrates a 3-stege system to produce an 8-point FFT. where the number of points may 
be calculated as 2^=8. The Worics system requires one processor for each of the three steps, for a total of three 
processors. The amount of memory associated with each processor grows geometrically at each step of the 
processing, as Illustrated for example In FIGURES 1 and 2 the article "Implementation Of A Pipeline FFT", by 
Sheats and Vickers. published NEREM 70. and in the article "A Pipeline Fast Fourier Transform" by Groginsky 

35 & Worics. published Eascom 1969. so the amount of memory required may become large when several steps 
of processing are involved. 

An improved FFT processor Is desired, in which the memory requiremente for the processors are equalized. 

Summary of the Invention 

A Butterfly Arithmetic Unit (BAU) for use In conjunction with a cascade of other similar unite in a pipeline 
FFT processor includes a multiplier for weighting one of the Informatton signals and for applying the weighted 
signal to a summing circuit. The summing circuit sums another infonmation signal with the weighted information 
signal from the multiplier to produce signals which are applied to a temporary store. The temporary stores in 
45 each BAU have the same capacity. The temporarily stored signals are read in pairs to produce output signals. 
The output signals are applied to the next Butterfly Arithmetic Unit in the cascade. In a particular embodiment 
of the invention, each BAU includes four registers for use as temporary stores, each of which has sbcteen 
addressable memory locattons, each of which is 24 bite wide to accommodate the input data format 

so Description of the Drawing 

FIGURE 1 is a simplrfi d block diagram of an FFT processor according to the invention including a cascade 
of Butterfly Arithmetic Unit processors (BAU). memories and a feedback arrangement; 
FIGURE 2 is a simplified block diagram of one of the Butterfly Arithmetic Unite of the cascade of FIGURE 
55 1; 

FIGURES 3a and 3b tabulate control states of summers and memory write address pointers of a BAU in 

the first position In a cascade of BAUs for real and imaginary information, rasp cth^ely; 

FIGURES 4a and 4b tabulate control states of summers and memory write address point rs of a BAU in 
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the second position in a cascade of BAUs for real and imaginary information, respectively; 

FIGURES 5a and 5b tabulate control states of summers and memory write address pointers of a BAU in 

the third position in a cascade of BAUs for real and imaginary information, respectiv ly, 

FIGURES 6a and 6b tabulate control states of summers and memory write address pointers of a BAU in 

5 the fourth position of a cascade of BAUs for real and imaginary information, respectiveiy; 

FIGURES 7a and 7b, FIGURES 8a and 8b. FIGURES 9a and 9b, and FIGURES 10a and 10b. are tables 
listing multiplexer configurations, memori s to be read, and the addresses to be read in those memories, 
during operation of the arrangement of FIGURE 2, for BAUs which are first second, third and fourth, re- 
spectively, in a cascade of BAUs such as that of FIGURE 1; 

10 FIGURE 1 1 a is a simplified flow chart illustrating logic flow for producing reshuffled write address pointers 
for certain memories illustrated In FIGURE 1 for reshuffling data after the first pass through the cascade 
of FIGURE 1 and FIGURES lib, 11c, and lid list the order of the addresses for 64-polnt. 128-pointand 
256-point FFTs, respectively; 

FIGURE 12 is a simplified flow chart Illustrating logic flow for producing reshuffled write address pointers 
IS for certain memories Illustrated in FIGURE 1 , for reshuffling data after two passes through the cascade of 
FIGURE 1 , and FIGURES 12b, 12:c. and 12d list the order of the addresses for 64-point and 128-point and 
256-point FFT, respectively; 

FIGURE 1 3a, b and c are the reshuffling address sequences for memories of FIGURE 1 after first, second 
and final passes through the cascade of FIGURE 1 for 512-point FFT, corresponding to 9 stages of pro- 
20 cessing; and 

FIGURES 14a, b and c are the reshuffling address sequences for memories of FIGURE 1 after first, second 
and final passes through the cascade of FIGURE 1 for 10-stage processing (1024 point FFT). 

Description Of The Invention^ 

25 

In FIGURE 1. four substantially identical Butterfly Arithnietic Unfe (BAU) 10. 12, 14 and 16 are cascaded. 
Each BAU has two inputs and two outputs. BAU 10 receives data designated "A" and "B" at its input data paths 
or ports 1 8 and 20, respectively, the data received at ports 1 8 and 20 is a continuous stream of similariy clocked 
digital data. Each set of data fe a stream of complex numbers representing samples of a wavefonm. Con- 
so ventionally, the data received at ports 1 8 and 20 represents first and second halves of the sampled wavefonm, 
respectively. A weighting and control circuit illustrated as a block 22 produces weighting signals and timing sig- 
nals for application to each of BAU 1 0, 1 2, 14 and 16. The weighting signals or coefficients produced by weight- 
ing and control block 22 are sines or cosine values of angles, where the magnitudes of the incremental angles 
are inversely related to the number of points in the FFT. The selection of the particular coefficients applicable 
35 to particular data points is well known to those skilled in the art Butterfly Arithmetic Unit 1 0 processes the data 
received at ports 18 and 20 in accordance with weighting signals received over a data path 34, and generates 
a pair of output signals, which are applied over paths 24 and 26 to BAU 12. BAU 1 2 similariy receives weighting 
signals over a further path 36 and processes its input signals to pnxJuce signals on a further pair of outputs 28 
and 30, which are applied as inputs to BAU 1 4. BAU 14 processes the input signals in conjunction with weighting 
40 signals received over a data path 38 to produce outputs on data paths 32 and 34 which are applied to the last 
BAU 16. BAU 16 processes the signals in conjunction with weighting signals received over data path 40 from 
> weighting and control blocks 22. Butterfly Arithmetic Unit 16 produces output signal on output data paths 42 
and 44. The output signal on data paths 42 and 44 represents a four-stage (16-point) FFT signal. Either more 
or less processing may be required, as described below. 
45 A multiplexer 45 in FIGURE 1 is coupled to digital RAM-type memories 46, 48, 50 and 52. The read and 
write addresses of memories 46, 48, 50 and 52 are controlled by address generators 66, 68, 70 and 72. re- 
spectively. A feedback path is completed by data paths 62 and 64 which extend firom multiplexer 45 to input 
ports 18 and 20, respectively, of BAU 10. 

Memories 46, 48, 50 and 52 are coupled to system input/output data paths or ports 6, 8, 7 and 9, respect- 
so ively, of a bus 4. by which streams of input data can be applied to the memories for buffering and for coupling 
' by multiplexer 45 and by paths 62 and 64 to the cascade of processors, and by which processed data produced 
at the output of BAU 16 and coupled to one or more of memories 46, 48. 50 and 52 may be stored therein for 
coupling to utilization means (not illustrated) by way of the associated output ports. Memories 46 and 48 may 
be considered to constitute a pair, and memories 50 and 52 likewise constitute a pair. 
55 In operation, streams of docked data words may be applied by way of paths 6 and 8 to memories 46 and 
48, and may be temporarily stored therein if the incoming data rate exceeds the processing rate of the BAU 
processors or if the processing requires more than four stages of processing, as described below. The data is 
applied through multiplex r 45 to data paths 62 and 64, for application as "A" and "B" data, respectively, to 
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inputs 18 and 20. respectively, of BAU 10. 

BAU 1 0 processes the data, as described in detail below, to produce a pair of output signals on data paths 
24 and 26 for application as A and B data, respectively, to BAU 12, the s cond processor in the cascade. BAU 
12 processes its input data, and produc s output data on data paths 28 and 30, which become the A and B 

5 data inputs for BAU 14. BAU 14 similarly process s Its input data and produces outputs on data paths 32 and 
34. The four-stage. 1 6-point FFT signals produced at the output from BAU 1 6 on data paths 42 and 44 may be 
coupled by multiplexer 45 to memories 50 and 52. respectively, where they are stored. The storage of data in 
memory pairs 46. 48 and 50. 52 is under the control of address generators 66 and 68, 70. 72 respectively. 
For four-stage FFT (1 6-point FFT) processing, the processed signals produced on date paths 42 and 44 

10 by BAU 16 require no further processing, and may be coupled through multiplexer 45 to memories 50 and 52 
and bidirectional data paths 7 and 9. In those cases in which more than four stages of processing is required, 
the processed output signal from BAU 16 is applied to memories 50 and 52, and read out. generally in a 
rearranged order, back through multiplexer 45 and data paths 62, 64 to input ports 18 and 20. respectively, of 
BAU 1 0. Further stages of processing are then perfonmed. For example, if five-stage FFT processing is desired. 

IS the date from memories 50 and 52 applied for a second pass through BAU 10 is processed in BAU 10. and 
the desired 5-stege processed signal becomes available on its output date paths 24 and 26. Since no further 
processing is required, the processed signals are "flushed" to date paths 42 and 44 by bypass paths (not illus- 
trated) which bypass BAUs 12, 14 and 16. As is known in the art. such bypasses may be required to have delays 
con-esponding to the processing delays of the BAUs in order to maintain synchronization. If six, seven or eight 

20 stages of FFT processing is desired, the output signals from BAU 16 resulting from a . first pass through the 
four steges is temporanly stored In memories 50 and 52, read out in reanranged or reshuffled order as described 
in more detail below, and applied through multiplexer 45 to the inpute of BAU 10 to begin a second pass through 
two, three or all four stages of the cascade of BAUs. respectively. 

For more than eight steges of processing (256-point FFT). the intemiediate products produced at the output 

25 of BAU 16 are altemately applied by multiplexer 45 to memory pairs 46. 48 and 50. 52 in a "ping-pong" fashion. 
The intermediate producte stored in each memory pair are reshuffled as mentioned above, and reapplied by 
way of multiplexer 45 and date paths 62 and 64 to the inputs of BAU 1 0. For example, for 1 0-stege processing, 
the input signals applied by way of input date paths 6 and 8 are temporarily stored in memories 46 and 48, and 
applied through multiplexer 45 and by way of date paths 62 and 64 to BAU 10 to stert processing by a first 

30 pass through four steges of BAU. The output of BAU 16 on this firet pass is coupled through multiplexer 45 to 
memories 50 and 52. The writing and reading of memories 50 and 52 reshuffles the date for application, back 
through multiplexer 45 and date paths 62. 64 to the inpute of BAU 1 0 to stert a second pass through the cascade 
of BAU processore. The output of BAU 1 6 at the end of this second pass represente 8-stege processing (256 
point FFT). so two more stages of processing are required. Memories 50 and 52 may still be reading date to 

35 the input of BAU 10. so multiplexer 45 is configured to couple the output of BAU 16 to memories 46 and 48. 
Memories 46 and 48 reshuffle the order of the date, and couple the reshuffled date back through multiplexer 
45 and date paths 62. 64 to the input of BAU 10. Two more steges of processing are accomplished in BAUs 
10 and 12. whereupon the desired lO-stege processing is completed. The outputs of BAU 12. on date paths 
28 and 30, are flushed past BAUs 14 and 16, and are applied through multiplexer 45 to nrtemories 50 and 52 

40 where the signals are made available to an external utilizing apparatus (not illustrated). 

While in the above examples the final output date is made available in memories 50 and 52, it may as readily 
become available in memories 46, 48. 

The aforementioned Worics system includes three steges. so it can perform an 8-point FFT. The feedback 
according to the invention allows the four-processor cascade to act as a 4N stage system or any number there- 

45 between, where N refers to the number of recirculations. Thus, any number of steges of processing may be 
accomplished with only four steges of processor, which is especially advantageous considering that the menrv 
ory requiremente for additional stages grow geometrically. In general, the arrangement according to the inven- 
tion may perform any number of steges of processing no matter how large, with relatively modest BAU 
processor internal memories, whereas Worths* arrangement appears to require increases in internal memory 

so size which may be impractical at the present state of the art above about 12 steges of processing. Also, the 
processor internal memory requirements vary among the Worte processors, depending on their position in the 
cascade. It should be noted that the memories external to the BAU processore are commensurate as betwe n 
WORKS* arrangement and arrangemente according to the invention. 

FIGURE 2 illustrates details of one of the BAU processore according to the inv ntion. such as BAU pro- 

55 cesser 10 of FIGURE 1. In FIGURE 2. elemente con-esponding to those of FIGURE 1 are designated by the 
same reference numerals. In FIGURE 2. date amving at BAU 10 by way of input ports 18 and 20 indud s real 
and imaginary porttons. The weighting signal received over date path 34 also includes real and imaginary por- 
tions. 
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The data applied to port 20 is designated B. The B data is divided into its real (BR) and imaginary (Bl) por- 
tions, and the real portion BR is applied to a first input of a multiplier pg 201 . The real portion BR of the B data 
is also applied to a first input of a multiplier 203. Multiplier 201 receives at its second input the r a! portion of 
the weighting signal (WR) applied to BAU 1 0 ov r data path 34. A second input of multiplier 203 also receives 

5 from data path 34 the imaginary portton Wl of the weighting signal. The Imaginary portion of the B data (Bl) 
from input port 20 is applied to first input ports of multipliers 202 and 204. The real portion WR of the weighting 
signal is applied to a second input of multiplier 204, and the imaginary portion Wl of th weighting signal is 
applied to a second input of multiplier 202. Multipliers 201 and 202 multiply together their respective inputs to 
generate outputs which are applied to a subtracting or differencing (-) circuit illustrated as a block 206. Multip- 

10 liers 203 and 204 multiply together their respective inputs and apply their outputs to an adding or summing (+) 
circuit illustrated as a biodc 208. The signals produced at the output of differencing circuit 206 represents the 
B data multiplied by the W data (BW), each with real and imaginary portions. The output of differencing circuit 
206 represents the real component (BWR) of product BW, given by the equation: 

BWR = (BR » WR) - (Bl » Wl) (1) 

IS where • represents multiplication. The output of summing circuit 208 is the imaginary component (BWI) of pro- 
ductBW: 

BWI = (BR • Wl) + (Bl * WR) (2) 
The data applied to input port 18 is designated A data. The A data applied to input port 18 is divided into 
. real (AR) and imaginary (Al) portions. The real portion AR is applied to firat inputs of controllable summers 210 
20 and 212. together with the BWR output of differencing circuit 206. The imaginary portion of the A data (Al) Is 
applied to inputs of controllable summere 214 and 216. together with BWI signals from summing circuit 208. 
Summing circuits 210 and 212. 214 and 216 controllably sum together their respective input signals as des- 
cribed below, and apply the summed signals to inputs of menriories illustrated as blocks 21 8. 220. 222 and 224, 
respectively, for temporary storage therein. At least some of the signals applied to inputs of memories 218. 
25 220, 222 and 224 are conceptually applied at sequential meimory locations. Howeyer, the memory locations 
are read in a manner described below, so that the stored information Is read relative to the storage sequence 
in a manner which is delayed and altemated between memory outputs. 

Each memory 218-224 has its N-bit output bus split to produce two identical N-bit outputs, designated L 
(left) and R (right). Thus, when a memory is read Wentlcal signals appear at its L and R outputs. As illustrated 
30 in FIGURE 2, memory 21 8 includes an L output and an R output, which are separately coupled to first inputs 
of multiplexers (MUX) 226 and 228, respectively. Similariy, the L and R outputs of memory 220 are separately 
applied to second inputs of MUX 226 and 228, respectively. The L outputs of memories 222 and 224 are sepa- 
rately applied to inputs of MUX 230. and the R outputs of memories 222 and 224 are separately applied to inputs 
of MUX 232. 

35 Multiplexers 226, 228, 230 and 232 merely couple the L and R outputs of the menK>ries with which they 
are associated to a single output line. For example, MUX 226 selects the L signal from memory 218 or 220. as 
. available, and couples it onto processor output data path 26. Similarly. MUX 228 selects the R output of 
memories 21 8 and 220. respectively, as avaHable. and couples tiiem onto processor output date patii 24. MUX 
230 and 232 perform the same functions for memories 222 and 224. Output date paths 24 and 26 represents 

40 the merged outpute of multiplexers 228. 230 and 226, 232. respectively. Control circuite 234 and 236 control 
summere 210-216, meirories 218-224, and multiplexere 226-232. respectively, as described below. 

Each BAU 10, 12, 14 and 16 of FIGURE 1 receives information by a patii (not illustrated) which identifies 
to the BAU ite location in the cascade. This may be accomplished by indexed connectore for each location, by 
multiposition switches which are programmable at ttie time the cascade is assembled, by links which are cut 

45 for ttie same purpose, or the like. This infonnation allows controllers 234 and 236 to Implement different control 
sequences, depending upon the position of the BAU in tiie cascade. 

FIGURES 3a and 3b represent the control algorithms of control blocks 234 and 236 of FIGURE 2, respect- 
ively, for the case in which the Butterfly Arittimetic Unit is in the first stege of ttie cascade of FIGURE 1 . which 
is tiie stege represented by BAU 1 0. FIGURE 3a relates to the summing and storage of real infonnation received 

50 by summers 210 and 212 of FIGURE 2. 

In FIGURE 3a, the condition during a recun-ent cyde of 16 clock pulses is tabulated. The first column, desig- 
nat d I. tabulates the dock pulses of tiie recurr nt cycle, as suggested by the h ading "CLOCK PULSE". Th 
s cond column, designated 11, represente ttie memory address of ttie write address pointer (WAP) for memory 
218 (F1) of FIGURE 2 for ach of clock cycles 1...16. The tinird column, designated III. repres nts the write 

55 address pointer (WAP) for memory 220 (F2) of FIGURE 2. Columns II and 111 of FIGURE 3a represent the st ps 
involved in writing into memory ttie real component of ttie outpute of summere 21 0 and 212 of FIGURE 2. The 
fourtti and frftti columns of FIGURE 3a. designated IV and V, are head d "LEFT SUMMER 210" and "RIGHT 
SUMMER 212", resp ctively, and represent ttie summer configuration to accomplish the summing process to 
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be implement d during the various dock pulses of each recurrent cycl by controllable summers 21 0 and 21 2, . 
respectively, of FIGURE 2. For example, during clock pulse 1 of ach recurrent cyde of sixt en dock pulses, 
data which was processed by summer 210 during an eariy portion of dock pulse 1 is stored at the memory 
location designat d 0 in memory 218, and data which was processed by summer 212 is stored in a memory 

5 location designat d WAP F1 + 1 in memory 220. Perfonming the indicated additton, during dock pulse 1 . WAP 
F1 =0. and the sum of 0 + 1 = 1 . Thus, the memory location at which the output of summer 212 (FIGURE 2) is 
stored during dock pulse 1 is memory location 1 . 

Also during dock pulse 1 of the recurrent cycle of 16 dock pulses tabulated in FIGURE 3a, left summer 
210 (FIGURE 2) is configured to add together the BWR data and AR data applied to its inputs as indicated by 

10 the word "ADD" in column IV of FIGURE 3a, while right summer 212 of FIGURE 2 is configured to subtract 
during dock pulse 1, as indicated by "SUB" in column V of FIGURE 3a. 

In a similar manner, during dock pulse 2 of the recun-ent cyde of 1 6 dock pulses tabulated in FIGURE 3a, 
left memory 218 (FIGURE 2) stores at address 1 the result of the subtraction which was performed by left sum- 
mer 210 during an eariy portion of clock pulse 2. Also during dock pulse 2. right memory 220 stores at location 

15 WAP F1 -1 the added signal produced during an eariy portion of dock pulse 2 by summer 212. Memory address 
WAP F1 -1 con^sponds to 1 -1 = 0. Thus, the signals stored in memories 21 8 and 220 during the first two dock 
cydes are stored at reversed addresses. During dock pulse 2. summers 21 0 and 212 are configured to subtract 
and add their input signals, respectively. 

Storage continues in the same manner during subsequent dock pulses, with storage during dock cyde 3 

20 occurring in memory 218 at memory address 2. and at memory address 3 of memory 220. Also during dock 
pulse 3, summer 21 0 adds and summer 21 2 subtracts. The process continues for 16 dock pulses, as specified 
by FIGURE 3a, and then begins again at dock pulse 1 for the next fdlowing cyde of dock pulses. 

FIGURE 3b tabulates control of imaginary components at the outputs of controllable summers 214 and 216 
of FIGURE 2, and of the addresses of memories 222 and 224 in which the summer outputs are stored, by control 

25 236 of FIGURE 2. also for the situation in which the BAU is the first in the cascade of FIGURE 1. FIGURE 3b 
is similar to FIGURE 3a, in that a recurrent cyde of 16 dock pulses is tabulated in cdumn I. Column II. desig- 
nated WAP F1, represents the write address pointer for memory 222. and column III. designated WAP F2. rep- 
resents the write address pointer for memory 224. Column IV. designated "left summer 214" represents the 
contrd condition of summer 21 4 for each dock pulse, and column V. designated "right summer 216" represente 

30 the control condition of summer 216. During clock pulse 1 of a recurrent cyde of 16 dock cydes tabulated in 
column I of FIGURE 3b. left summer 214 is configured to subtract (column IV). while right summer 216 is con- 
figured to add (column V). Also during clock cyde 1. the write address pointer for memory 224 (column II!) 
causes the output of summer 216 to be stored at a memory address designated 0. while, refemng to column 
II, memory 222 stores the output of summer 214 at memory location 0+1 = 1. During dock pulse 2 tabulated 

35 in FIGURE 3b. memory 224 stores at memory location 1 the subtracted output pn^duced by summer 21 6 during 
an eariy portion of the clock pulse, while memory 222 stores at a memory location 1-1 = 0 the added signal 
produced by summer 214. During clock pulse 1 6 (column I of FIGURE 3b). memory 224 stores at address loca- 
tion 15 (column 111) the subtracted output produced by summer 216 during dock pulse 16 (column V). and 
memory 222 stores at address 15-1 = 14 (column I!) the added output produced by summer 214 during dock 

40 pulse 16 (column IV). At the next fdlowing dock pulse, the conditions con^spond to those of dock pulse 1. 
and the cyde repeate. The control conditions during the other dock pulses tabulated in FIGURE 3 will be obvi- 
ous by reference to the FIGURE, without additbnai explanatk>n. 

The contrd sequence Illustrated in FIGURES 3a and 3b does not indude a description of control of niul- 
tiplexers 226. 228. 230 or 232 of FIGURE 2. because multiplexer control Is required only during memory reading 

45 and not during memory writing. 

As an overview of the contnDl of memories and summers tabulated in FIGURE 3a, it is noted that the control 
states simply alternate, i.e.. WAP F1 + 1; WAP F1 - 1; WAP F1 + 1;.... The completion of a cyde of alternation 
occurs after a number of cydes equal to twice the address increment or decrement Thus, in FIGURE 3a the 
addresses are incremented or decremented by 1, and a complete cycle of altemation WAP F1 + 1; WAP F1 - 

so 1 occurs in 2x1=2 dock pulses. There are eight such two-dock-pulse-cydes in the 16 dock pulse recurrence 
cyde tabulated in FIGURE 3a. FIGURE 3b is similar to FIGURE 3a. 

FIGURES 4a and 4b tabulate the write address pointer for m mory pairs 218 and 220, and 222 and 224, 
respectively, and the control of contrdlable summer pairs 210 and 212, and 214 and 216, respectively, for the 
case in which th BAU is in the second stage position occupied by BAU 12 in FIGURE 1 Clock pulses for the 

65 BAU which is in the second stage begin with dock pulse 1 . which is not time-coincident with dock pulse 1 tabu- 
lated in FIGURES 3a and 3b for the BAU in the first stage of the cascade. Clock pulse 1 of FIGURES 4a (and 
4b) occur at the time the first word of processed date arrives at an input of th second stage BAU, delayed 
from the first dock pulse of the first-stage BAU by the processing delay of the first stage BAU. This principle 
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applies to all the stages of processing d scribed b low. 

FIGURE 4a tabulates control of controllable summ rs 210 and 212. and of the writing into memories 218 
and 220, by control 234 of FIGURE 2, for the situation in which the BAU is the second in the cascade of FIGURE 
1. occupying the position of BAU 12. Summers 210 and 212 receive real infonrnatlon AR and BWR. FIGURE 

5 4a thus tabulates the steps involved in writing Into memory the r al component at the outputs of summers 210 
and 212 of FIGURE 2 for the second BAU in the cascade. In FIGURE 4a, during dock pulse 1 tabulated in 
column 1, the output of controllable summer 210 is written into the address of m mory 218 designated 0, as 
indicated by column II of FIGURE 4a, headed "WAP F1". Also during dock pulse 1. the output of summer 212 
of FIGURE 2 is written into an address of memory 220 which is tabulated in column III as "WAP F1 + 2". cor- 

10 responding to 0+2 = 2. Thus, the information written into memory 220 is spaced by two addresses from that 
written into memory 218. Also during clock pulse 1, left summer 210 is configured to add, and right summer 
212 is configured to subtract, as indicated by columns IV and V of FIGURE 4a. During clock pulse 2 tabulated 
in column I of FIGURE 4a, the added signal which is produced at the output of summer 21 0 during clock pulse 
2 is written into address 1 of memory 21 8 (column II). and the subtracted signal from the output of summer 212 

IS is written into address location 1+2 = 3 of memory 220. During clock pulse 2, summer 210 is configured for 
addition (column IV), and summer 212 is configured for subtraction (column V). During dock pulse 3. the sub- 
tracted signal (Column !V) produced at the output of summer 210 during dock pulse 3 is written into address 
2 of memory 21 8 (Column II), while ttie added signal produced at the output of summer 212 (Column V) is written 
into address 2-2 = 0 of memory 220 (Column III). During dock pulse 4, the subtracted signal produced at the 

20 output of summer 21 0 (Column IV) is written Into mentory address 3 of memory 21 8 (Column II). while the added 
signal produced at the output of summer 212 (Column V) is written into menrrary location 3-2 = 1 of memory 
220 (Column III). Control of memory 218. memory 220. summers 210 and 212 continues as designated In the 
corresponding columns of FIGURE 4a for 1 6 dock pulses, and then rettims to the top of the table to begin again 
at clock pulse 1 . 

25 FIGURE 4b tabulates contrd of summers 214 and 216, and mennories 222 and 224 of FIGURE 2. for the 
case in which the BAU is second in the cascade of FIGURE 1. corresponding to BAU locatfon 12. Summers 
214 and 216 of FIGURE 2 receive imaginary Information Al and BWI. During dock pulse 1. the added output 
of summer 216 (Column V) is applied to memory address 0 of memory 224, as indicated by column III, headed 
"WAP F2". and ttie subttacted output of sunurter 214 (Column iV) is applied to memory address WAP F1 + 2 

so = 0+2 = 2 of memory 222, as tabulated in column II of FIGURE 4b. During dock pulse 2. the added signal pro- 
duced during dock pulse 2 by summer 216 (Column V) is stored at memory address 1 of memory 224 (column 
III), and the subtracted signal produced at the output of summer 214 during dock pulse 2 (Column IV) is stored 
at memory address 1+2 = 3 of memory 222 (Column II of FIGURE 4b). The control of summers 214 and 216, 
and the write addressing of memories 222 and 224. continues as tabulated in FIGURE 4b until dock pulse 16 

35 of the recurrent cyde of dock pulses, after which it begins again at dock pulse 1 of the next following cyde. 

Summarizing ttie operatton tabulated in FIGURE 4a, the cydlcal offset of write address locations recurs 
every four clock pulses: WAP F1 + 2; WAP F1 + 2; WAP F1 - 2; WAP F1 - 2;.,.. The four-dock-pulse-cyde 
equals twice the address offset (2) of two address locations, i.e. 4=2x2. Four such four-clock-pulse-cydes occur 
in ttie 16 dock pulse recunrence cyde tabulated in FIGURE 4a. FIGURE 4b is similar to FIGURE 4a. 

40 FIGURE 5a tabulates, for each dock pulse of a recurrent cyde of 16 dock pulses, the state of the write 
address pointere for memories 218 and 220, and the configuration of summere 210 and 212 during operation 
of a BAU such as that of FIGURE 2 in the third position of a cascade of BAUs, conresponding to ttie position 
of BAU 14 of FIGURE 1 . The tabulation of FIGURE 5a relates to the processing of real information. 

As tabulated in FIGURE 5a. during clock pulse 1 (column I) the data from summer 21 0 is written into memory 

.45 address 0 of memory 21 8, as indicated, in the column II. headed "WAP F1 ". Similariy, from column ill for dock 
pulse 1, infonmation from summer 212 is written into memory address WAP F1 + 4 = 0 + 4 = 4. During dock 
pulse 1 . summer 210 is configured to add and summer 212 is configured to subtract, as indicated in Columns 
IV and V. respectively. During dock pulse 2. the infonDation added (Column IV) by summer 210 during dock 
pulse 2 is written into memory 218 at address location 1, as indicated by the numeral 1 in column 11 of FIGURE 

50 5a. Also during dock pulse 2. the subtracted signal produced during clock pulse 2 at the output of summer 212 
(Column V) is written into memory address WAP F1 + 4 = 1+ 4 = 5 of memory 220, as indicated in column III. 
In FIGURE 5a, the write address pointer for memory 218 continually increments during ttie 16 dock pulses, 
taking on values ranging from 0 to 15. The write address pointer for memory 220, as tebulat d in column 3, 
has a value which is the value of the write address pointer of memory 21 8 to which fe ither added or subtracted 

55 the value 4, on a recurrent 4-dock-pulse basis. Thus, during dock pulses 1 ttirough 4. the write address as 
tabulated in column III of FIGURE 5a is "WAP F1 + 4". and during the next four dock pulses, namely dock 
pulses 5-8. the tebulat d value of the write address pointer for memory 220 is listed as "WAP F1 - 4". This Is 
an eight-dock-pulse-cyde. Similariy, during dock pulses 9 through 12, memory 220 writes into address four 
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greater than the address of memory 218 and during the last four dock pulses of the recurrent cycle, namely 
clock pulses 1 3-18, memory 220 is written into at addresses which are four less than the con-esponding address 
of memory 218. Thus, the write address for memory 220 Is four greater or four less than the corresponding 
address of memory 218, cycling at an eight clock pulse rate. The configuration of the left summer 210 and right 
5 summer 212 cycles between add and subtract configuration on the same eight-clock-pulse basis. 

FIGURE 5b tabulates write addresses for memories 222 and 224, and the configurations of summers 214 
and 216, for a BAU occupying the third position in a cascade of BAUs, corresponding to the location of BAU 
14 of FIGURE 1, and for imaginary Information. FIGURE 5b is generally similar to FIGURE 5a. In FIGURE 5b. 
memory 224 is written into at addresses 0. 1 , 2. ...1 5 (column III) over the course of 16 dock pulses of the recur- 
to rent cyde of clock pulses. Memory 222 is written into at addresses either four greater than or four less than 
the conresponding write address of memory 224 (column II). cycling on an eight-dock-pulse basis, just as does 
nnemory 220 as tabulated in column III of FIGURE 5a. Two such eight-dock-pulse cycles occur during the tabu- 
lated 16-dock-pulse recurrent cycle. The tabulations of the configuration of summer 214 and summer 216 in 
columns IV and V of FIGURE 5b Indicate that the configuration remains the same for four clock pulses, and 
15 then changes at each Increment of four dock pulses, just as was the case in FIGURE 5a for sumnriers 210 and 
212. 

FIGURES 6a and 6b tabulate the write pointer address for memory pairs 21 8. 220; and 222. 224, respect- 
ively, and the states of summer pairs 210, 212; 214, 216, respectively, forelock pulses 1 through 16 of a recur- 
rent cyde of 16 dock pulses. The nnemory address pointers for memories 218, 220. 222 and 224, as tabulated 

20 in columns 11 and 111 of FIGURE 6a, and 11 and III of FIGURE 6b, respectively, are each 0 at ttie first dock pulse, 
and increment by unity for each dock pulse through dock pulse 16. One complete cyde of write address poin- 
ters occurs in 16 dock pulses, without increments or decrements. The summer configurations tabulated in col- 
umns IV and V of FIGURES 6a and 6b remain in ttie same condition for the first eight dock pulses of each 
recurrent cycle, and switch to a second condition for the remaining eight dock pulses of each recurrent cyde 

25 of 16 dock pulses. 

Recapitulating tiie cyding of tiie summer configuration and ttie memory address pointer value for BAUs in 
tfie first, second, third and fourth positions in ttie cascade, It is noted that in tiie first position of the cascade a 
complete cyde of alternation occurs in two dock pulses for the first position, in four dock pulses for the second 
position, and eight dock pulses for tiie third position and in 16 dock pulses in the fourth position, while the tenr>- 

30 poral offeet, either plus of minus, for each position is lialf the alternation cyde, i.e. 1 , 2, 4, for ttie first, second 
and third stages, respectively. There is no offset for ttie fburtti stage because the data is written to memory as 
described below and is reconfigured in the memory as required. 

The reading of stored infonmation from memories 21 8. 220; 222, 224 and the multiplexing togetfier of the 
infbnnation read from the memories, is also under the control of controllers 234 and 236 of FIGURE 2. FIGURE 

35 7a tabulates the control states for reading real Information from memories 218 and 220 of FIGURE 2, and for 
multiplexing of the information, for a BAU occupying a cascade position corresponding to that of BAU 10 of 
FIGURE 1 . It should particulariy be noted ttiat FIGURE 7a relates only to the real component of the infonmation 
read from memories 218 and 220 of FIGURE 2 and produced on output data paths 24 and 26 of the BAU. The 
Imaginary component is read from memories 222 and 224. and is controlled as described below in conjunction 

40 witti FIGURE 7b. Column I of FIGURE 7a tabulates the clock pulses. Column II represents the conditions under 
which real data is read to or appears at output data path 26, while column III represents the condittons under 
which a real data component is read to or appears at output data path 24. 

In FIGURE 7a, the column (columns II or III) in which a notation appears indicates which of the two output 
data pattis (path 26 or 24, respectively) it affects, and ttierefbre indicates which of multiplexers 226 or 228 is 

45 affected. For example, column II of FIGURE 7a relates to tiie real component of ttie signal on output data patti 
26, and therefore ttie "File 1" or Tile 2" notations in Cdumn II refer to multiplexer 226. In particular, tiie "File 
1" notation means that multiplexer 226 reads from the left (L) output of memory 218. and the "File 2" notation 
means ttiat multiplexer 226 reads from Left (L) output of memory 220. The register or "REG" notation in elttier 
column II or 111 of FIGURE 7a identifies the address of memories 218 and 220 which is read. As a similar 

50 example, column III of FIGURE 7a relates to the real signal component of output data path 24. and tiierefbre 
its "File 1" or "File 2" notation relates to the condition of multiplexer 228 In reading from the R outputs of 
memories 218 and 220, respectively. Thus, in FIGURE 7a. "File 1" may be interpret d to mean a signal origi- 
nating from memory 218, and "File 2" means a signal originating from memory 220. 

In FIGURE 7a. the tabulation in column I indudes a 16-clock-pulse recurrent cycle as in FIGURES 3-6. 

55 and also indudes information relating to the operation at initial turn-on (dock pulses 1 through 9), when the 
infonmation first begins to flow into ttie cascade over data paths 62 and 64 (FIGURE 1 ). In particular, in FIGURE 
7a. the recurrent portion of the cycle Is ttie portion extending firom dock pulses 10 through dock pulse 25, witti 
ttie return arrow 71 0 representing the retum to tiie condition of dock pulse 1 0 for the beginning of ttie next recurr- 
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ence cycle. During dock pulses 1 through 9, the operation of multiplexers 226 and 228. and the reading of 
memories 218 and 220, may be viewed as being atypical. In particular, during dock pulses 1 through 9. for 
each stag of the cascade, the multiplexers and memories are read and produce outputs at data paths 24 and 
26, but the data is not accepted by the next BAU stage of the cascade, so the memories or the. multiplexers 
5 may be vi w d as b ing inhibited during certain dock pulses. 

The delay of dock pulse 1 as tabulated In column Cdumns II and III of FIGURE 7a is attributable to th 
requirements of the FFT algorithm, while the delay of dock pulses 2 through 9 in column II is attributable to 
the periodoftimerequiredforBdatatoflowfromBdata input tenminal 20 ofFIGURE2to the outputs of sum^^ 

206 or 208 of FIGURE 2. 

10 During dock pulse 2, address or register (REG) 0 of memory 220 (File 2) is read (Column 111 of FIGURE 

7a). The stored data in memory 220 is read at address 0 and appears at both left (L) and right (R) outputs of 
the memory. The notation "File 2" in column III indicates that multiplexer 228 is configured to couple to data 
path 24 the data being read from the R output of memory 220, and is configured so as not to couple to the R 
sideof memory 218. 

IS During dock pulse 3. odumn 11 of FIGURE 7a notes "DELAY", thereby indicating that the real part of signals 
read to output data path 26 are not used by the next stage in the cascade; this may be viewed asjnhibiting 
multiplexer 226. During dock pulse 3. column III (relating to data path 24) bears the notation "File 1", thereby 
indicating that memory 218 is read, and the notation "REG 1" indicates that it is read at address location 1 , and 
memory 218 therefore produces mutually identical output signals at both its L and R outputs. Memories 218 

20 and 220 may be. and in general are. loaded with different data, so the Infbnmation may not be identical as be- 
tween outputs of different memories. 

Refening now to dock pulse 4 tabulated in cdumn I of FIGURE 7a. a "DELAY" notation appeare in cdumn 
II. so multiplexer 226 is viewed as inhibited. The "File 2" notation in Column III means that data is read to output 
data path 24 from the R output of memory 220 and not from the R output of memory 218, and the "REG 2" 

25 notation in column 111 means that memory 220 is read at address location 2. 

From dock pulse 5 to dock pulse 9. column II of FIGURE 7a bears the "DELAY" notation, whereby mul- 
tiplexer 226 continues to be viewed as inhibited. Also during dock pulses 5 to 9, the memory addresses which 
are read increment from address 3 to address 7, Multiplexer 228 (associated with output data path 24) alternates 
its configuration (i.e.. altemates from "File 1" to "File 2") to alternately couple from the R output of memories 

30 218and220. ' 

Beginning with dock pulse 10 of FIGURE 7a. both columns II and 111 of FIGURE 7a bear notations other 
than "DELAY", thereby indicating that both multiplexers produce usable (or used) outputs. In particular, during 
dock pulse 10, multiplexer 226 (because we are reading column II) reads from the L output (File 1) of the 
associated memory, which by reference to FIGURE 2 is seen to be memory 21 8. Memory 21 8 is read at address 

35 location zero (REG 0). Also during clock pulse 1 0. and refening to column III. multiplexer 228 reads from the 
R output of memory 220 (FDe 2) that information which is stored at address location 8 (REG 8). 

During dock pulse 1 1 . reference to column II of FIGURE 7a indicates by the notation "File 2" that multiplexer 
226 reads from memory 220, and by "REG 1 " indicates that memory 220 is read at address locatton 1 . Refer- 
ence to column III indicates by "File 1" that multiplexer 228 reads from memory 218. and by "REG 9", that 

40 address location 9 of memory 218 is read. 

The configuration of the reading of the real component of date from memories 218 and 220 can be under- 
stood for the remaining dock pulses from the above description In conjunction with FIGURE 7a. After dock 
pulse 25 is completed, the configuration retums to that indicated by dock pulse 10. and operation continues 
in a recurrent cyde. 

45 FIGURE 7b is similar to FIGURE 7a in that it relates to a BAU which is the first stege of a cascade, but 
relates to the configurations of multiplexers and memories for the imaginary (as opposed to real) portion of the 
date coupled onto output date paths 24 and 26 of FIGURE 2. As a consequence of representing memory and 
multiplexer configurations which handle the imaginary portions of the date, the notations of FIGURE 7b tebulate 
reading of memories 222 and 224 of FIGURE 2. and the configurations of multiplexers 230 and 232. As in the 

so case of FIGURE 7a. FIGURE 7b includes delays atbibuteble to the FFT algorithm Iteelf ("DELAY(FFT)"). and 
due to processing ("AUM LATENCY^. 

During dock pulse 1 tebulated in FIGURE 7b. botti multiplexers are "inhibit d". as indicated by the "Delay" 
notetion. During dock pulses 2 through 9. signals are not coupled onto date path 26. as indicated by the "delay" 
notetions in column 111. As indicated by ttie "File" and "Reg" notetions in Column II, signals are coupled onto 

55 output date path 24 during dock pulses 1-9. Therefore, during dock pulses 2-9 (where the hyphen represente 
tiie word "through") the notetions "File 1" and "File 2" refer only to multiplexer 230. The multiplexer configuration 
altemates betw en reading memory 222 (File 1) and memory 224 (File 2) during dock pulses 1-9. During dock 
pulse 2. the L output of memory 222 is read at address 0; during dock pulse 3. the L output of memory 224 is 
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read at address 1; and during clock pulse 4, the L output of memory 222 is r ad at address 2. The alternation 
of multiplexer 230 continues to and after dock pulse 10. 

As in the discussion of FIGURE 7a, during dock pulse 10. both multiplexers 230 and 232 produce usable 
outputs. In particular, column II indicates that multiplexer 230 reads address 8 from memory 222 by the L output, 
5 and column III indicates that multiplex r 232 reads address 0 from memory 224. During dock pulse 1 1 . mul- 
tiplexer 230 reads address 9 from memory 224 (Column II), and multiplexer 232 reads address 1 finom memory 
222 (Column III). The remaining configurations for dock pulses 12-25 should be apparent from a comparison 
of the above discussion and FIGURE 7b. After dock pulse 25. the configuratton returns to that of dock pulse 

10 and continues in a recurrent cyde. 

10 The recun^nt cycles described by FIGURES 7a for real data and FIGURE 7b for imaginary data operate 
concurrently, to produce a stream of real and imagaiary data on each of output data paths 24 and 26 after the 
initial start-up period. The stream of data produced on data paths 24 and 26 In accordance with the operation 
described in conjunction with FIGURES 7a and 7b is that which is produced when the BAU is the first in the 
cascade. The operatton of a BAU in the second stage of the cascade is represented by FIGURES 8a and 8b. 

IS FIGURE 8a tabulates the operation of nwmories 218 and 220. and multiplexers 226 and 228 of FIGURE 
2, for produdng real components of data on output data paths 26 and 24 when the particular BAU is second 
in a cascade. Delays attributable to the FFT algorithms inhibit both multiplexers 226 and 228 during dock pulses 

1 and 2. During clock pulses 3-10 as tabulated in column I of FIGURE 8a. column II indicates TDELAY". and 
therefore multiplexer 226 is "inhibited." During dock pulses 3 and 4 of FIGURE 8a, multiplexer 228 reads the 

20 R output of memory 220 ("Ffle 2" notation in column III for both dock pulses 3 and 4). at address 0 during dock 
pulse 3 (REG 0) and at address 1 during dock pulse 4 ("REG 1"). During dock pulses 5 and 6. multiplexer 228 
reads the R output of memoiy 21 8 ("File 1 ") at addresses 2 and 3 ("REG 2 and 3"), respectively. During dodc 
pulses 7 and 8. multiplexer 228 reads the R output of memory 220 (File 2) at locations 4 and 5. respectively, 
and during dock pulses 9 and 10 multiplexer 228 reads the R output of memory 218 at addresses 6 and 7, 

28 respectively. ^ ^, 

Beginning with dodk pulse 1 1 of FIGURE 8a, both multiplexers 226 and 228 of FIGURE 2 produce usable 
outputs. Refemng to column II, multiplexer 226 reads from the L output of memory 218 (Fie 1) at addresses 
0 and 1 during clock pulses 1 1 and 12, respectively, and referring to column III, multiplexer 228 reads from the 
R output of memory 220 (File 2) at address locations 8 and 9. respectively. The reading continues to dock cyde 

30 26. and then returns to the conditions associated wfth dock pulse 1 1 for a recurrent operating cyde of 16 dock 
pulses. Each multiplexer configuration is assumed for two consecutive dock pulses, i.e., during dock cydes 

1 1 and 12 multiplexer 226 remains in the File 1 configuration reading memory 218. and multiplexer 228 remains 
in the File 2 configuration for reading memory 220. The addresses being read increment by one address per 
dock pulse. 

35 FIGURE 8b tabulates ttie operation of memories 222 and 224, and multiplexers 230 and 232 of FIGURE 
2, for produdng imaginary components of data on output data paths 24 and 26 when the particular BAU is sec- 
ond In a cascade. Qock pulses correspond to those of FIGURE 8a. Delays attributable to the FFT algoritiim 
inhibit both multiplexers 230 and 232 during clock pulses 1 and 2. During dock pulses 3-10 as tabulated in 
Column I of FIGURE 8b, Column III indicates "DELAY", and therefore multiplexer 232 Is "inhibited". During dock 

40 pulses 3 and 4 of FIGURE 8b. multiplexer 230 reads the L output of memory 222 ("File 1" notation in Column 
II for both dock pulses 3 and 4), at address 0 during dock pulse 3 (REG 0). and at address .1 during dock pulse 
4 (REG 1). During dock pulses 5 and 6, multiplexer 230 reads the L output of memory 224 (FBe 2) at addresses 

2 and 3 fREG 2" and "REG 3", respectively). During dock pulses 7 and 8. multiplexer 230 reads the L output 
. of memory 222 fFHe 1") at locations 4 and 5. respectively, and during dock pulses 9 and 10 multiplexer 230 

45 reads the L output of memory 224 at addresses 6 and 7. respectively. 

' Beginning with dock pulses 1 1 and 1 2 of FIGURE 8b. both multiplexers 230 and 232 of FIGURE 2 produce 
usable outputs. Referring to Column ill. multiplexer 232 reads from ttie R output of memory 224 at address 
locations 0 and 1 . respectively, and referring to Column II. multiplexer 230 reads from tiie L output of memory 
222 at addresses locations 8 and 9, respectively. The reading continues to dock pulse 26, and then returns to 

50 the conditions assodated with dock pulse 1 1 for a recunnent operating cyde of 1 6 dock pulses. Each multiplexer 
configuration is assumed fortwo consecutive dock pulses, as with FIGURE 8a The addresses being read incre- 
m nt by one address per dock pulse. 

FIGURES 9a and 9b tabulate information relating to multipl x r configuration and memory reading forth 
BAU of FIGURE 2 which is in ttie ttiird stage of the cascade of FIGURE 1 . FIGURE 9a relat s to real information 

55 components coupled onto output data patti 24 and 26. and FIGURE 9b relates to imaginary compon nts. Col- 
umn I of FIGURES 9a and 9b tabulates dock pulses which conrespond to ttiose of FIGURES 7 and 8 in ttie 
manner described above. Column II of FIGURE 9a and column III of FIGURE 9b relate to memories and mul- 
tiplexers which couple onto output data patti 26. and column III of RGURE 9a and column II of FIGURE 9b 
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relate to memories and multiplexers which couple onto output data path 24^ 

In FIGURE 9a. the "DELAY" notation during dock puis s 1-4 indicates that the multiplexers are "inhibited". 

During dock puls^ 5-12, the "DELAY" notation in column II indicates that the multiplexer supplying output data 

path 26 is "inhibited", while the notations in column ill describe the operating state of multiplexer 228. In par- 
5 ticular, during dock pulses 5-8. multiplexer 228 is in the "File 2" state, conresponding to reading from the R 

output of memory 220. The address at which memory 220 is read increm nts by one during each dock cyde. 

During dock cydes 9-12, multiplexer 228 is in the "File 1" state, reading from the R output of memory 218. 

Beginning with dock cyde 1 3. which is the beginning of the recurrent cyde of 1 6 dock pulses, both multiplexers 

take on states which are maintained for periods of four dock pulses. For example, during dock pulses 13-16. 
10 multiplexer 226 remains in the "File 1" state, while multiplexer 228 remains in the "File 2" state. At dock pulse 

17, they switch, and hold their respective states until dock pulse 20. Operation continues to clock pulse 28, 

after which the configuration returns to that of dock pulse 13, thereby beginning anew the recurrent cyde of 

16 dock pulses. 

From the above discussion, in conjunction with FIGURE 9b, the operation of the multiplexers and memones 

IS should be apparent for the Imaginary components of signals on output data paths 24 and 26 of a BAU in the 
third position of a cascade. As in FIGURE 9a, full cydic operation does not begin until dock pulse 13, and the 
multiplexers maintain the same state for four consecutive dock pulses. 

FIGURE 10a tabulates multiplexer states and memory addresses for producing real signal components on 
output data paths 24 and 26 of a BAU which is fourth in the cascade of FIGURE 1. As illustrated in FIGURE 

20 10a, there is no delay, and all outputs are used immediately. The recurrence occurs in 16 pulses. The "FDe 1" 
notation in column II for dock pulses 1-8 indicate that multiplexer 226 reads from the L output of memory 21 8 
for the first eight pulses of the recurrent cyde of pulses. Similariy. the "File 2" notation in column 111 for dock 
pulses 1-8 indicates that multiplexer 228 reads from the R output of memory 220. From dock pulse 9 to 16, 
the multiplexers assume the opposite states. From the above discussion, the meaning of the tabulation of FIG- 

25 URE 10b should be dear. 

As so far described, the cascade of four BAUs may be used for one-, two-, three- or four-stage FFT pro- 
cessing (two-, four-, eight- or sixteen-point FFT). As mentioned, this is perfomied by applying the input signals 
over data paths 6 and 8 of FIGURE 1 , through multiplexer 45 to the inputs of BAU 1 0, by flushing the signal to 
data paths 42 and 44 after the desired number of stages of processing, and by coupling the resulting signals 

30 to memories 50 and 52 for use application to a utilization apparatus. 

Regardless of the number of passes or partial passes through the cascade of four stages, the data initially 
applied to memories 46 and 48 from the extemal source is scrambled before it is applied to BAU 1 0 for an Initial 
step of processing, as described below. 

In particular, for any FFT processing, the incoming data from data paths 6 and 8 is written into, and read 

35 from, memories 46 and 48 in a sequence which results in scrambling of the data words. Such scrambling may 
be accomplished by writing into memory in a scrambled order and reading in a normal sequence (0, 1 . 2. 3...), 
or vice versa. For simplidty. It is assumed that the data arriving at memories 46 and 48 from data paths 6 and 
8 is written into the memories in nomial sequence. Each of memories 46 and 48 may be sakf to contain half 
of the Input data points, with memory 46 containing the first half and memory 48 containing the second half. 

40 Memories 46 and 48 are read In a bit-reversed address sequence (described below), and the data so read is 
applied, by way of multiplexer 45 and data paths 62. 64 to the input ports of BAU 10 to begin a first pass through 
the cascade of BAUs of FIGURE 1 . The bit reversed sequence is generated from a normal monotonic digital 
sequence (expressed as a decimal sequence 0. 1, 2, 3....) by reversing the positions of digits of a particular 
larger significance with the digits of the same lesser significance, as by reversing the MSB and LSB, the 2nd 

45 MSB with the 2nd LSB and so forth. TABLE I Olustrates a bit-reversed sequence for a 3-blt word. 
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TABLE I 





NORMAL" 


SEQUENCE 


keverse 


SEOUENCK 


5 


Decimal 


Binary 


pinary 


jny ^ S 1 

DGCXIaai. 




0 


000 


000 


V 




.1 


001 


100 


•k 




2 


010 


010 


2 


4/) 


3 


Oil 


110 


6 




4 


100 


001 


1 




5 


101 


101 


5 


IS 


,6 


110 


oil 


3 




7 


111 


111 


. 7 



The number of bits in the address sequence depends on the number of words to be stored, and therefore 
20 depends upon the window size. The 3-bit address of TABLE I is suitable for an 8-word window. TABl^ II lists 
the normal and blt-reveised sequences for a 4-blt address sequence (window of 1 6 words.) 

TABLE II 



25 


NORMAL 


SEQUENCE 


BIT-REVERSED SEQUENCE 


Decimal 


^inarY 


Binary 


Decimal 




0 


0000 


0000 


0 




1 


OOOl 


1000 


8 


30 


2 


0010 


0100 


4 




3 


. 0011 


1100 


. 12 




4 


0100 


0010 


2 


35 


5 


0101 


1010 


10 . 




6 


0110 


0110 


6 




7 


0111 


1110 






8 


1000 


0001 


1 


40 


9 


, 1001 


1001 


9 




10 


1010 


0101 


5 




11 


1011 


1101 


13 


45 


12 


1100 


0011 


3 




13 


1101 


1011 


11 




14 


1110 


0111 


7 




15 


1111 


1111 


15 



so 



The bit-reverse reading of memories 46 and 48 to couple data to the cascade for the first pass therethrough 
applies only to the first reading, no matter how many passes through the cascade are required. 

The dimensions of m mories 46. 48 (and 50 and 52) are selected to store 1/2(2N) words, where N is the 
55 maximum number of stages of processing which may be desired. A major advantage of the described arrange- 
ment is that the number of stages of processing may be incr ased simply by increasing the storage capacity 
of the external memories (memories 46, 48. 50 and 52 of FIGURE 1 ) without any change In the amount of stor- 
age capadty within memories 21 8. 220. 222 and 224 of FIGURE 2. 
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The number of bits of th bit-reversed read addresses rr»ust correspond to the number of addresses into 
which data is to be load d. In g n ral, the number of word storage capacity of each memory equals half th 
number of points in the FFT. For example, four-stage processing (16-point FFT) requires that each memory 
46, 48, 50 and 52 have the capacity to store a number of words equal to 16/2 = 8. In order to address eight 
memory locations, the address word must have three bits. Similarly, five-stage processing (32-point FFT) 
requires 32/2 = 18 word storag locations in each external memory 46, 48, 50 and 52, and the address word 
for reading or writing must have four bits. The 3-bit bitiBversed sequence of Table I would be ad quate for 
read addressing the memories for a 16-point FFT, but the four-bit sequence of Table II would be required for 
the 32-point FFT. 

Table III tabulates the bit-reversed sequence for a five-bit binary word, which may be used for a 64-point 
FFT. 



NORMAL SEQUENCE 
Peeinal P?,n^ry 



0 
1 
2 
3 



00000 
00001 
00010 
00011 



TABLE III 

BIT-REVERSED SEQUENCE 
Binary pecjmal 



00000 
10000 
01000 
11000 



0 
16 

8 
24 



28 
29 
30 
31 



11100 
11101 
11110 
11111 



00111 
10111 
01111 
11111 



7 
23 

15 
31 



When one, two, three or four stages of processing (two-, four-, eight- and sbrteen-point FFT, respectively) 
are desired, only one pass through the cascade of BAU pnacessors of FIGURE 1 is required, flushing the data 
past those BAUs whose processing is not required. Processed data appears at data paths 42 and 44 at the 
output of BAU 16. and is to be nr«de available to the utilization apparatus external to FIGURE 1. Processed 
data may become available at data paths 42 and 44 before reading of data from memories 46 and 48 to the 
inputs of BAU 10 is completed. Multiplexer 45 is therefore configured to couple the processed data from data 
paths 42 and 44 to memories 50 and 52. Menwries 50 and 52 are addressed In nomral sequence for writing, 
and are read in due course by the'utilization apparatus, also in nonmal sequence. Thus, memories 50 and 52 
operate as simple buffers when four or less stages of FFT processing are desired. 

When five, six. seven or eight stages of processing are desired (32, 64, 128, 256-point FFT, respectively), 
a first complete pass of the signal through the cascade of four BAUs is performed as described above, and the 
resulting four-stage processed signal is coupled from data paths 42 and 44 to memories 50 and 52, and is stored 
and retrieved in a manner which may result in a shuffling ("reshuffling") of data words. The reshuffling may be 
accomplished by controliably "scrambling" the write addresses and reading sequentially, or by writing sequen- 
tially and reading scrambled addresses. For definiteness, only one scrambling scheme is described, but other 
equivalent methods may be used. 

For processing by more than four stages but less than nine stages (i.e. for five, six, seven or eight stages), 
the data is written from data bus s 6 and 8 into memories 46 and 48 in normal sequence as describ d above, 
and is read by a bit-reversed sequence. The "bit-revereed" data is applied to BAU 10 as described above. After 
four stages of processing, data is written from BAU 16 into memories 50 and 52. in another scrambi d or 
"reshuffled" sequ nee. which is describ d below and which is different from the above-described bit-revere 
sequence. The data which is written into memories 50 and 52 with "reshuffled" addresses is read with nomnal 
sequence read addresses. The reshuffled infonnation read from memories 50 and 52 is directed by multiplexer 
45 to data paths 62 and 64 for coupling to ttie input ports of BAU 1 0 to begin a second pass through the cascad 
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of BAUs of FIGURE 1 . The second pass may result in a total of five stages of FFT processing (32-point FFT) 
If the second pass through the cascade includes processing only by BAU 10 and flushing past BAUs 12. 14 
and 1 6. Additional processing by BAUs 12. U and 16 results in six, seven or ight-stage processing (64-. 128- 
^ or 256-point FFT). Since memories 50 and 52 may still be actively reading data to BAU 1 0 at the time at which 

5 processed data appears at data paths 42 and 44. the complet d data is applied to memories 46 and 48. The 
data is written into memories 46 and 48 In scrambi d sequence. 

FIGURE 11a Illustrates a flow chart applicable for generating memory write addresses for 32-point and 
more FFT (5 and more stages) processing. This flow chart, when applied to a 32-point FFT (N=5) happens to 
produce a normal sequence. The sequence of memory addresses generated by the flow chart of FIGURE 1 1 a 

10 is used to write data into memories 50 and 52 after the first pass of data through the cascade of BAUs of FIGURE 

1. . . 

The complete processing sequences for flve-stage processing (32-point FFT). then, is (a) load memones 
46 and 48 In normal sequence from the source of signal; (b) read memories 46 and 48 with bit-reversed 4-bit 
addresses; (c) apply the data read from memories 46 and 48 to BAU 10, and process the data through BAUs 

IS 10, 12, 14 and 16; (d) write the processed data from BAU 16 into memories 50 and 52 in nonntial sequence; (e) 
read data from memories 50 and 52 in nomial sequence; (f) apply the data read from memories 50 and 52 to 
the input of BAU 10 for processing; (g) flush the processed output of BAU 10 past BAUs 12, 14 and 16; and 
(h) write the processed, flushed data into memories 46 and 48 In normal sequence, whereupon the 32-point 
FFT is avaOable in memories 46 and 48. where it may be accessed by a utilization apparatus. 

20 FIGURE 1 la is a flow chart illustrating a method for generation of scrambled intennediate addresses. The 
addresses are termed "intennediate" because they are used at a stage of processing after the firet use of scrann- 
bled addressing (the bit-reversed addresses used at the initiation of processing) and before the last possible 
use of scrambled addressing, described below. The logic of FIGURE 1 la is initiated at a START block 1110, 
and flows to a block 1112. representing the reading of the desired FFT size. FIGURE 11a applies to FFT sees 

25 equal to or greater than 5 stages (32-point). The size is expressed as 2n, where N represents the number of 
stages of processing. The logic flows to block 1114. which represents the setting of a variable INCrement to 
equal 2^/26. The exponent 5 relates to the position of BAU 1 0 as the 5th stage of processing on a second pass 
through the cascade of BAUs. Block 1116 represents the setting of a running variable I to zero, and block 1118 
sets a second running variable, J. to a value of zero. The logic flows to a memory address generator, represen- 

30 ted as a block 1 120. which establishes the cun^nt address by adding the cunrent value of running vanaWe I to 
the product of running variable J and INCrement (J • INC) + 1. The address is made avanable at an output node 
1 121 , and the logic flow loops back by way of a path 1122 to a further block 1 124. at which the current value 
of running variable J is incremented by 1 . The new value of running variable J is compared with a limit value 
of J In decision block 1126. The limit value of 16 is derived from the expression (2* = 16), where the exponent 

35 4 relates to the number of BAUs. If J has not reached the limit value of 16. the logic leaves decision block 1 126 
by the NO path, and flows by path 1128 back to block 1120. which again generates an address equal to 
(J*INC)+I. ft will be dear that a loop is fomed by which block 1 1 20 produces addresses having an initial valu 
of I. incrementing by INC to an upper value set by decision block 1 126. When the upper limit is reached in decK 
sion block 1 126, the logic leaves block 1 126 by the YES output and path 1 1 30. and reaches a block 1 1 32. rep- 

40 resenting the incrementing of running variable I. Decision block 1 1 34 compares the current value of I to INC. 
So long as I is less than INC. the logic flow leaves decision block 1134 by the NO exit, and proceeds by path 
1 136 back to block 1 1 1 8. In block 1 1 1 8. J is again set to zero. These logfe connections set up nested loops by 
which sequences of addresses are generated, which start with I and increment by INC, with each sequence 
starting with a new value of 1 (see FIGURES 1 1 b and 1 1 c). When the full sequence has been generated, dech 

*s sionblock1134routesthelogicflowtoaSTOPblock1138. 

As mentioned, when the logic of FIGURE 1 1 a is used to generate addresses for 5-stage processing (32- 
point FFT), the addresses which are produced at node 1121 happen to be in normal sequence. This results 
because 2N/25 in block 1 1 14 has a value of unity, and the increment is therefore t)y ones. In the first pass through 
the logic, block 1 120 starts with an initial value I of zero, and muftiples INC by zero. Decision block 1 126 is 

50 satisfied when J=16. 

For a 64-point FFT (six-stage processing), data is initially loaded in nomial sequence into memones 46 
and 48, and read in a bit-reversed sequence as described above. The data r ad from memories 46 and 48 is 
coupled through multiplexer 45 and data paths 62 and 64 to BAU 10 to begin a firet pass through the cascade. 
At the completion of the first pass, the data produced on data paths 42 and 44 is written into memories 50 and 

55 52 in scrambled order, generated as described above by the flow chart of FIGURE 1 la. For a 64-point FFT. 
N=6. so INC is set to a value of two in block 1 1 14 of FIGURE 11a. The sequence of memory addresses is illus- 
trated in FIGURE 1 1 b. Following the writing into memories 50 and 52 in scrambled sequenc s the intermediate 
data is read from memories 50 and 52 in nomial sequence for application through multiplexer 45 and data paths 
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62 and 64 of FIGURE 1 back to the inputs of BAU 10. The data is processed a second time by BAU 10 and by 
BAU 12, and the processed data is then flushed past BAUs 14 and 16 for application by way of data paths 42 
and 44 to memories 46 and 48. The data is writt n into memories 46 and 48 in a "final" sarambled sequence, 
generated by the flow chart Illustrated in FIGURE 12a. FIGURE 12b Hlustrates the final address sequence for 

5 a 64-point FFT. The scrambi d final s quence Is used for writing process d data from data paths 42 and 43 
into memories 46, 48 after the second pass through the cascade of BAUs. 

FIGURE 12a is simflar to, but not identical with, FIGURE 11a. Elements of FIGURE 12A corresponding to 
those of FIGURE 1 1 a are designated by the same reference numerals, and similar elements which are not iden- 
tical to those of FIGURE 11a are designated by the same reference numbers in the 1200 series, rather than 

10 in the 1 100 series. FIGURE 12 is used to generate write addresses for writing into memories 46 and 48 after 
the second pass through the cascade of BAUs of FIGURE 1 during 5-stage to 8-stage (32 to 256 point) pro- 
cessing. It is also used for 9-stage processing, for writing into memories 46 and 48 afterthe second pass through 
the cascade. 

In FIGURE 12a. biock 1220 generates memory addresses equal to I + (16 X J), decision block 1 226 com- 

15 pares running variable J with variable INC. and decisfon block 1234 compares the cunrent value of I with 16. 
With these changes to the flow chart, the anrangement of FIGURE 12a produces addresses in the order illus- 
trated in FIGURES 12b. 12c and 12d for sbc-, seven-, and eight-stage processing, respectively, corresponding 
to 64, 128 and 256-point FFT processing, respectively. The arrangement of FIGURE 12a produces a nonmal 
sequence for 5-stage processing (32-point FFT). 

20 For a 1 28-point FFT (N=7), seven stages of BAU processing is required. Processing begins by writing data 
into memories 46 and 48 in normal sequence. The data is read from memories 46 and 48 in bit-reversed sequ- 
ence, as described above, for application to the input ports of BAU 1 0 to begin a first pass through the cascade. 
When the first pass is completed, the processed data being produced by BAU 16 is coupled by way of data 
paths 42, 44 and multiplexer 45 to memories 50. 52. The data is written into memories 50, 52 in the scrambled 

25 sequence generated by the flow chart of FIGURE 1 1 a. with the value of N set to 7. With this value of N, the 
sequence of write addresses is fllustrated in FIGURE 11c. The data is written into memories 50, 52 with the 
scrambled sequence, and *en read in nonmal sequence for application, by way of multiplexer 45 and data paths 
60 and 62, back to the input ports of BAU 10 to begin the second pass through the cascade. The data is pro- 
cessed through BAUs 10. 12 and 14. and is flushed past BAU 16 to data paths 42 and 44. thereby completing 

30 the second and final pass through the cascade. The processed data is written into memories 46 and 48 in a 
scrambled sequence generated by the flow chart of FIGURE 12, for a value of N set to 7. With this value of N, 
the address sequence is set forth in FIGURE 12c. 

For eight stages of processing (256-point FFT). the data is initially applied from data paths 6. 8 to memories 
46 and 48, and written into the memories in nomnal sequence. The data is read from memories 46 and 48 in 

35 bit-reversed sequence, and applied to BAU 1 0 to begin processing. After four stages of processing, the partially 
processed data is written into nrremories 50 and 52 in the scrambled sequence of FIGURE lid. generated by 
the flow chart of FIGURE 11a. The data Is read from memories 50 and 52 in normal sequence, and is applied 
to BAU 10 to begin a second pass through the cascade. At the completion of the second pass through the cas- 
cade of BAUs, the data is. written into memories 46 and 48 in a second scrambled sequence generated by the 

40 flow chart of FIGURE 1 2a, which sequence is illustrated in FIGURE 12d. The data ttien becomes available to 
the utilization apparatus. 

For more ttian eight stages of processing (i.e.. for nine through 12 stages of processing), two complete 
passes through the cascade of BAUs. plus a third partial or complete pass, are required. The initial reading of 
data from memories 46 and 48 in all cases is by bit-reversed sequence, described above. 

45 For nine stages of processing, FIGURES 13a. 13b and 13c illustrate the first intennediate, second inter- 
mediate, and final data reshuffling addresses. The first intenmediate addresses of FIGURE 1 3a are used to write 
data into memories 50 and 52 following the first pass through the cascade, the second intermediate addresses 
of FIGURE 13b are used to write data into memories 46 and 48 following the second complete pass, and the 
final addresses of FIGURE 13c are for writing data into memories 50 and 52 when processing is complete. 

50 The firet and second intermediate, and final reshuffling addresses for ten stages of processing (1024-point 
FFT) are illustrated in FIGURES 14a, b, and c. respectively. For any number of stages of processing, the men>- 
ory addresses following the first pass ttirough the cascade are generated as d scribed in conjunction with FIG- 
URE 11a. 

Other embodim nts of the invention will be apparent to those skOI d in the art For example, while the 
55 anBngement of FIGURE 12 is good up to nine stages, it may be expanded every four additional stages, by 
adding an additional DO loop to the flowcharL 
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Claims 

1. A butterfly arithmetic unit adapted for use in a cascade of similar arithmetic units in a pipeline FFT pro- 
cessor, and adapted for accepting a stream of complex first and second infomnation signals and complex 

5 weighting signals, said first and second infomnation signals representing successive samples of a time- 

varying quantity, said arithm tic unit comprising: 

complex multiplying means coupled for receiving said first and weighting signals for multiplying at 
least a component of said first infbmiation signals by said weighting signals for generating real and imagi- 
nary portions of a multiplied signal; 

id first summing means coupled to said complex multiplying means for generating as a summed output 

signal one of the sum and the difference of saki real and imaginary portions of said multiplied signal; 

controllable second summing means including a first input port coupled to said first summing means 
for receiving said summed output signal therefrom, a second inpirt port coupled for receiving at least a 
component of said second information signals, and also including a control input port, for cbntrollably sunrv 

15 ming, by one of addition and subtraction, one of said real and imaginary portions of said second Information 

signal with said summed output signal from said first summing means to form a first controllably summed 
signal; 

controllable third summing means including a first input port coupled to said first summing means 
for receiving said summed output signal therefrom, a second input port coupled for receiving at least a 
20 component of said second information signals, and also Including a control input port, for controllably sum- 

ming, by one of addition and subtraction, one of said real and Imaginary portions of said second Information 
signal with said summed output signal flnom said first summing means to fom a second oontrollably sunn- 
med signal; 

controllable first storage means including an input port coupled to said second sunrtmlng means, 

25 and also including first and second output ports, for temporarily storing said first controllably summed sig- 

nals at storage locations, said first storage means being capable of being read for thereby producing left 
and right components of first stored Infomiation signals at said left and right output ports, respectively, of 
said first storage means; 

controllable second storage means including an input port coupled to said third summing means. 

30 and also including first and second output ports, for temporarily storing said second controllably sumnried 

signals at storage locations, said second storage means being capable of being read for thereby producing 
left and right components of second stored information signals at said left and right output ports, respect- 
ively, of said second storage means; 

controllable first multiplexing means including a first input port coupled to said left output port of 

35 said first storage means and a second input port coupled to said left output port of said second storage 

means, and also including an output port, for coupling to said output port of said first multiplexing means 
said left component of one of said first and second stored information signals; 

controllable second multiplexing means including a first input port coupled to said right input port 
of said first storage means and a siecond input port coupled to said right output port of said second storage 

40 means, and also Including an output port, for oDupling to said output port of said second multiplexing means 

said right component of one of said first and second stored information signals, respectively; and 

control means coupled to said second and third summing means, to said first and second storage 
means, and to said first and second multiplexing means, for causing said controllable second and third 
summing means to switch their summing from addition to subtraction in a cyclical mannen which depends 

45 upon the location of said arithmetic unit in said cascade, for causing said first and second storage means 

to write their incoming controllably summed signals cyclically into selected ones of said storage locations 
relatively separated by an integer number of storage locations, which integer depends upon the said posi- 
tion of said arithmetic unit in said cascade, and which integer may be zero. 

60 2. A butterfly arithmetic unit for use In a pipeline FFT processor which includes a cascade of plural similar 
arithmetic units, each arithmetic unit being adapted to receive a stream of sets of first and second infor- 
mation woixls tog ther with a con-esponding stream of weighting words, each of said first and second infor- 
mation words and said weighting words including real and imaginary components, said butterfly unit 
comprising: 

55 multiplying means for multiplying corresponding ones of weighting and first information words 

together to produce real and imaginary components of weighted first infonmation words; 

first summing means coupled to said multiplying means for receiving said real component of saki 
weighted first Information words therefirom, and also coupled for receiving at least said real component of 
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corresponding ones of said second infonmation words, for sunfiming said real components of said s cond 
information words and said weighted first information words to simultaneously fonm at least two real inter- 
mediate sum words; 

first storage means coupled to said first sunrwning means for temporarily storing said two real inter- 
m diate sum words, said first storage means simultaneously generating left and right components of first 

and second stor d words; 

second summing means coupled to said multiplying means for receiving said imaginary component 
of said weighted first information words therefrom, and also coupled to receive at least said imaginary conv 
ponent of corresponding ones of said second infonmation words, for summing said imaginary components 
of said second information and said weighted first infomnation words to simultaneously form at least two 
imaginary intermediate sum words; 

second storage means coupled to said second summing means for temporarily storing said two 
imaginary intennediate sum words, said second storage means simultaneously reading left and right com- 
ponents of third and fourth stored words; 

first multiplexing means coupled to said first storage means and to a first output port of said butterfly 
arithmetic unit for simultaneously coupling to said first output port one of first and second word combi- 
nations, said first word combination being the combination of said left component of said first stored word 
and said left component of said second stored word, and said second word combination being the com- 
bination of said right component of said first stored word and said right stored word; 

second multiplexing means coupled to said second storage means and to a second output port of 
said butterfly arithmetic unit for simultaneously coupling to said second output port one of third and fourth 
word combinations, said third signal combination being the combination of said left component of said third 
stored word and said left component of said fourth stored word, and said fourth word combination being 
the combination of said right component of said third stored word and said right component of said fourth 
stored word. 

A pipeline Fast Fourier transfomi processor array, adapted for processing a stream of complex first and 
second information signals representing successive samples of a time-varying quantity, said array com- 
prising: 

a controllable source of complex weighting signals; 

a first butterfly arithmetic unit including complex multiplying means including first and second input 
ports coupled for receiving said first and weighting signals, respectively, for multiplying said first infor- 
mation signals by said weighting signals for generating real and imaginary portions of a multiplied signal, 
first summing means coupled to said complex multiplying means for generating as a summed output signal 
one of the sum and the difference of said real and imaginary portions of said multiplied signal, controllable 
second summing means including a first input port coupled to said first summing means for receiving said 
summed output signal therefrom, a second input port, coupled for receiving at least a component of said 
second information signals, and also including a control input port, for controllably summing, by one of 
addition and subtraction, one of said real and imaginary portions of said second infomrtation signal with 
said summed output signal from said first summing means to form a first controllably summed signal con- 
trollable third summing means including a first input port coupled to said first sunmning means for receiving 
said summed output signal therefrom, a second input port coupled for receiving at least a component of 
said second information signals, and also including a control input port, for controllably summing, by one 
of addition and subtraction, one of said real and Imaginary portions of said second informatton signal with 
said summed output signal from said first summing means to fonm a second controllably summed signal 
controllable first storage means including an input port coupled to said second summing means, and also 
including first and second input ports, for temporarily storing said first controllably summed signals at stor- 
age locations, said first storage means being capable of being read simultaneously at two storage locations 
for thereby producing first left and right stored infonmation signals at said left and right output ports, re- 
spectively, of said first storage means controllable second^storage means including an input port coupled 
to said third summing means, and also including first and second input ports, for temporarily storing said 
second controllably summ d signals at storage locations, said second storage means being capable of 
being read simultaneously at two storage locations for thereby producing left and right components of sec- 
ond stored information signals at said left and right output ports, respectively, of said second storage 
means controllable first multiplexing means Including a first input port coupled to said left output port of 
said first storage means and a second input port coupled to said left output port of said second storage 
means,and also including an output port, for coupling to said output port of said first multiplexing means 
said left component of one of said first and second stor d infonmation signals controllable second multi- 
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plexing means induding a first input port coupled to said right input port of said first storage nneans and 
as second input port coupled to said right output port of said second storage m ans, and also including 
an output port, for coupling to said output port of said second multiplexing means said right component 
of one of said first and s cond stored information signals, respectively; and control m ans coupled to said 

6 - second and third summing means, to said first and second storage means, and to said first and second 
multiplexing means, for causing said controllable second and third summing means to switch th ir sum- 
ming from addition to substraction in a cyclical manner which depends upon the location of said arithmetic 
unit in said cascade, for causing said first and second storage means to write their incoming controllably 
summed signals cyclically into selected ones of said storage locations relatively separated by an integer 

10 number of storage locations, which integer depends upon the said portion of said arithmetic unit in said 

cascade, and which integer may be zero, for causing said first and second storage means to read alter- 
nately, and for causing said first and second storage means when used to read firom storage location pairs, 
which pairs depend upon said position of said arithmetic unit in said cascade; 

second, third and fourth buttertiy arithmetic units similar to said first butterfly arittimetic unit, said 

IS second, third and fourth butterfly arithmetic units being cascaded, said complex multiplying means of each 

. butterfly arithmetic unit being coupled to said output port of said second multiplexing means of the pre- 
ceding butterfly arittimetic unit for reoewing said right component of one of said first and second stored 
information signals of said preceding butterfly arittwnetic unit as said first Information signal, and wKh said 
second input ports of said second and tiiird summing means of each of said second, ttiird and fourth but- 

20 terfly arithmetic units coupled to said output port of said first multiplexing means of ttie preceding butterfly 

arittimetic unit for receiving said left component of one of said first and second stored Information signate 
of said preceding butterfly aritiimetic unit as said second information signal; and 

means coupled to said control means of said first, second, ttiird and fourth butterfly arittimetic units 
for providing information relating to tineir location In said cascade. 



25 



L A pipeline Fast Fourier transform processor according to daim 3. further ownprising storage means coup- 
led to said output ports of said first and second multiplexing means of said fourth butterfly arithmetic unit, 
to said first input port of said oamplex multiplying means, and to said second input port of said controllable 
second summing means of said first butterfly arithmetic unit, for storing signals processed by said fourth 
30 butterfly arittimetic unit, and for coupling said signals processed by said fourth butterfly arittimetic unit back 
to said first butterfly aritiimetic unK for further processing. 

5. A pipeline Fast Fourier tiansfbim processor according to daim 4, further comprising scrambled addressing 
means for addressing said storage means in orders which differ as between reading and writing, whereby 
35 said signals processed by said fourth butterfly aritiimetic unit are reshuffled before being coupled back to 

said first butterfly arittimetic unit 
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